Tree Dependent Identically Distributed Learning

نویسندگان

Tony Jebara

Philip M. Long

چکیده

We view a dataset of points or samples as having an underlying, yet unspecified, tree structure and exploit this assumption in learning problems. Such a tree structure assumption is equivalent to treating a dataset as being tree dependent identically distributed or tdid and preserves exchangeability. This extends traditional iid assumptions on data since each datum can be sampled sequentially after being conditioned on a parent. Instead of hypothesizing a single best tree structure, we infer a richer Bayesian posterior distribution over tree structures from a given dataset. We compute this posterior over (directed or undirected) trees via the Laplacian of conditional distributions between pairs of input data points. This posterior distribution is efficiently normalized by the Laplacian’s determinant and also facilitates novel maximum likelihood estimators, efficient expectations and other useful inference computations. In a classification setting, tdid assumptions yield a criterion that maximizes the determinant of a matrix of conditional distributions between pairs of input and output points. This leads to a novel classification algorithm we call the Maximum Determinant Machine. Unsupervised and supervised experiments are shown.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Note on the Strong Law of Large Numbers

Petrov (1996) proved the connection between general moment conditions and the applicability of the strong law of large numbers to a sequence of pairwise independent and identically distributed random variables. This note examines this connection to a sequence of pairwise negative quadrant dependent (NQD) and identically distributed random variables. As a consequence of the main theorem ...

متن کامل

Higher moments portfolio Optimization with unequal weights based on Generalized Capital Asset pricing model with independent and identically asymmetric Power Distribution

The main criterion in investment decisions is to maximize the investors utility. Traditional capital asset pricing models cannot be used when asset returns do not follow a normal distribution. For this reason, we use capital asset pricing model with independent and identically asymmetric power distributed (CAPM-IIAPD) and capital asset pricing model with asymmetric independent and identically a...

متن کامل

Simulation of (M1, M2)-dependent random fields with K-distributed marginals

Amethod to simulate a two-dimensional (m1,m2)-dependent random field Y with K-distributed marginals is presented. The simulation starts with a random field with independent and identically standardized normally distributed elements. Then a (m1,m2)-dependent matrix is calculated using weighted sums. It has identically standardized normally distributed marginals. From this matrix the desired rand...

متن کامل

Time Series Models

Overview In contrast to the classical linear regression model, in which the components of the dependent variable vector y are not identically distributed (because its mean vector varies with the regressors) but may be independently distributed, time series models have dependent variables which may be identically distributed, but are typically not independent across ovbservations. Such models ar...

متن کامل